Method and apparatus for obtaining photogrammetric data to estimate impact severity

ABSTRACT

In one embodiment, the present invention includes a computer-implemented method to collect information to determine damage to a vehicle involved in a collision using photogrammetric techniques. When determined, this vehicle damage information, which may be in the form of crush measurement information such as a crush damage profile, can be displayed in a computer-generated view of the subject vehicle with a crush damage profile and used to estimate the impact severity. In some embodiments, based on the photogrammetric information derived, a direction of any shifting of the vehicle&#39;s components may be obtained and used along with other information to estimate a principal direction of force (PDOF) for one or more vehicles involved in the collision. Still further embodiments may be used to generate and/or audit repair estimates based at least in part on the photogrammetric information.

This application claims priority to U.S. Provisional Patent Application No. 60/811,964 filed on Jun. 8, 2006 in the name of Scott D. Kidd and Darrin A. Smith entitled METHOD AND APPARATUS FOR OBTAINING PHOTOGRAMMETRIC DATA TO ESTIMATE IMPACT SEVERITY.

BACKGROUND

Embodiments of the present invention relate to vehicular accident analysis and specifically to the collection and analysis of information for using the principles of photogrammetry to estimate vehicle damage and impact severity.

Organizations such as insurance companies and others are tasked with investigating accidents to resolve property and injury claims. Part of the investigation is to determine the severity and direction of the impact. A review of repair estimates and photos is usually done to develop a qualitative assessment of impact severity. If there is damage beyond the bumper of the vehicle(s) (i.e., there is crush damage to the vehicle), the vehicle is often given the qualitative assessment of a significant impact based on a subjective review of the damage information. As much as 40% of accidents of a low severity are subjectively analyzed as a high severity impact, primarily because of the damage to the vehicle(s).

One solution to determining crush damage is to measure the crush when the vehicle is examined for damage and use that information to determine impact severity. The measurement of crush requires some training to understand the concepts and insure consistency, and is also time consuming. With high turnover in the insurance industry and a desire to improve operational efficiency, this creates an ongoing and potentially expensive effort.

Alternately, organizations can retain forensic experts such as engineers and accident reconstructionists to evaluate accident information, reconstruct the accident scenario and characteristics including the determination of the magnitude and direction of the impact. This is an expensive and non-timely solution to be used on wide-scale basis.

The current solution for determining the components and operations required to fix a damaged vehicle is to visually inspect the vehicle. A list of components is created identifying which should be repaired or replaced. This visual inspection process frequently requires a second or more inspection to correct errors in the first inspection. This is a labor intensive and inefficient process. The current process does not leverage the information from similar impacts to similar vehicles.

The Principal Direction of Force (PDOF) is the main axis along which the force of the impact acts on the vehicle. The PDOF is a factor in both the determining of injury potential of the accident and determining which components of the vehicle are damaged. The current method of determining PDOF is to examine photos, descriptions of the accident and possibly scene information to determine the final resting locations of the vehicles. In low to moderate severity impacts, the final resting locations of the vehicles are either not particularly relevant or not available. The current evaluation of PDOF is done by forensic experts which is an expensive and time consuming process.

SUMMARY

In one aspect, the present invention includes a computer-implemented method to collect information to determine vehicle damage by photogrammetry. A computer system is used to send pictures and camera settings from one computer to another computer to determine vehicle crush damage via photogrammetric techniques. Cameras used to take pictures for use in the program may be identified and their characteristics stored in a central location for use when photos taken by the camera are sent to the central location. The camera and pictures can be selected for transfer from one computer to another computer via computer network, wireless transmission, and the like.

In another aspect, a computer-generated selection of vehicle poses is presented via computer. The selection of the pose determines the angle at which the photo was taken and sets the frame of reference. The pose can also be determined by placing optical markers on the vehicle at the time the photos are taken. The optical markers can help detect points on the vehicle such as the tires, and can thus aid in determining the pose automatically via the computer analysis of the photos.

Still in another aspect, a computer-generated selection of control points is presented via computer. Standard points on the vehicle that were not damaged in the subject accident are marked in each of the photos of the subject vehicle. These standard points may allow for comparison between different poses of the vehicle to allow triangulation of the points between the different photos. The control points can also be determined by placing optical markers on the vehicle at the time the photos are taken. The optical markers can help detect points on the vehicle such as the windshield, and refine the pose automatically via computer analysis of the photos.

In another aspect, a computer-generated grid of vehicle damage points is presented and selected for the subject vehicle. The damage points allow for comparison between different poses of the vehicle to allow the triangulation of the points between the different photos. The damage points can also be determined by placing optical markers on the vehicle at the time the photos are taken. The optical markers can help to detect points on the vehicle such as the head lights or tail lights, and can thus aid in determining the damage automatically via the computer analysis of the photos.

In another aspect, a computer-generated crush damage profile may be calculated for the subject vehicle using photogrammetic techniques and comparing the points of the damaged vehicle generated by photogrammetry to an undamaged vehicle. The results may be displayed in a computer-generated view of the subject vehicle with a crush damage profile and used to estimate the impact severity.

In another aspect, a computer-implemented method of determining the direction of shifting of the vehicle's components may be provided. This information can be combined with information about pre-accident actions and questions regarding the pre-accident vehicle speeds via computer to estimate the Principal Direction of Force (PDOF) for the vehicles involved in the collision.

In another aspect, the depth of damage and direction of impact to the vehicle can be compared with similar vehicles of similar depth of damage and direction of impact with respect to the components required to repair the vehicle. The comparison can be used to generate a list of components that may need to be repaired or replaced to restore the vehicle. The comparison can alternatively be used to audit repair estimates that were developed independently, in some implementations.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a flow diagram of a method to capture and store camera calibration characteristics in a central location in accordance with one embodiment of the present invention.

FIG. 2 is a flow diagram of a method of sending picture and camera information to a central location to conduct a photogrammetric analysis of vehicle damage in accordance with one embodiment of the present invention.

FIG. 3 is a diagram defining object, camera and image reference frames.

FIG. 4 is a diagram defining the relationship between the camera reference frame and the image plane.

FIG. 5 shows the principal point offset.

FIG. 6 shows an example of a planar calibration pattern.

FIGS. 7 A-C show examples of an undistorted image and lens distorted images.

FIG. 8 is a conceptual diagram of the transformation from the object to the camera space.

FIG. 9 is a diagram defining the error in image space and object space.

FIG. 10 is a flow diagram of a method of evaluating photos using photogrammetry techniques in accordance with one embodiment of the present invention.

FIG. 11 is a diagram showing triangulation from two views.

FIG. 12 is a conceptual diagram depicting rays back projected from imperfectly measured image points.

FIG. 13 is a diagram showing the concept of bundle adjustment.

FIG. 14 is a diagram showing the concept of comparing a damaged vehicle with an exemplar vehicle.

FIG. 15 is a flow diagram of a method of creating a crush damage profile using photogrammetry techniques in accordance with one embodiment of the present invention.

FIG. 16 is a flow diagram of a method of determining PDOF from a crush damage profile generated via photogrammetry in accordance with one embodiment of the present invention.

FIG. 17 is a flow diagram of a method of estimating component damage for repairing a vehicle from a crush damage profile generated via photogrammetry in accordance with one embodiment of the present invention.

FIG. 18 is a block diagram of a system in accordance with one embodiment of the present invention.

DETAILED DESCRIPTION

Photos of vehicles damaged in a collision can be taken to determine the extent of damage. These photos then can be used to estimate the amount of deformation to the vehicle and determine the impact severity for the vehicle. Impact severity can be used to determine the possible injury potential for the accident or to determine what components should be repaired or replaced to repair the vehicle.

Photogrammetry is a measurement technology in which the three-dimensional coordinates of points on an object are determined by measurements made in two or more photographic images taken from different positions. Embodiments of the present invention use a photogrammetric system for measuring vehicular crush from photographs. This system can be broken down into four steps: (1) camera calibration; (2) camera pose estimation; (3) triangulation; and (4) bundle adjustment.

Camera calibration is the process for identifying an individual camera's geometric and optical characteristics, so that metric information can be obtained from its images. A camera's characteristics can be divided into two categories—extrinsic parameters and intrinsic parameters. Extrinsic parameters refer to the spatial relationship between the camera and the object of interest. Intrinsic parameters refer to the camera's optical characteristics.

The process of determining a camera's position and aiming direction (or orientation) from known XYZ coordinates of the object is called resection. In computer vision literature, this is also known as the exterior orientation problem or camera pose estimation problem. The known XYZ coordinates of the object are called control points.

Photogrammetry uses a principle called triangulation to determine an object's three-dimensional coordinates from multiple photographs. Points are triangulated by finding the intersection of converging lines of sight, or rays. By taking photographs of the object from at least two different locations and measuring the same target in each photograph, lines of sight are defined from each camera position to the target. If the camera positions and orientations are known, then these rays can be intersected to find the 3D coordinate in object space.

Bundle adjustment is the process of refining a visual reconstruction to produce jointly optimal structure (3D feature points of an object) and motion (camera pose).

Different techniques for obtaining data and analyzing the obtained data can be used to employ photogrammetric algorithms to determine the extent of damage to a vehicle. Referring now to FIG. 1, shown is one embodiment of a process 100 of collecting camera calibration information. In one embodiment, the information may be provided from the camera to a central location for determining characteristic information and storing the information for later use. As shown in FIG. 1, a known calibration object is photographed to determine the intrinsic errors in the camera itself, as shown in block 105. Control is passed to decision diamond 110 to determine if the calibration points are to be selected manually from the calibration photo. If the calibration points are to be selected manually from the calibration photo, then control is passed to block 115 to allow the calibration points to be selected manually. Once the calibration points are identified, control is passed to block 120 to send the calibration photos for evaluation, e.g., at a central location. Then control passes to block 125 to determine the camera's internal characteristics and correction. If automatic calibration is to be used, control is passed from decision diamond 110 to block 120 to send the photos for evaluation. Then control is passed to block 125 to determine the camera's internal characteristics and correction factors. Once the optical characteristics for a camera are known, the camera can be identified and its parameters stored for future reference as shown in block 130.

Referring now to FIG. 2, a flow diagram shows a process 200 of taking photos and sending a camera selection to a central location as one embodiment of the present invention. As shown in block 205, photos can be taken with markers to indicate points on the damaged vehicle or without markers. The photos can be sent via electronic means, such as a computer-to-computer communication to a central location as shown in block 210. As shown in block 215, information regarding the camera used to take the photos can be selected and sent via computer to a central location. Once the photos are received in the central location, control is passed to decision diamond 220 to evaluate the camera settings file that accompanies the photos. If the settings with the photos match the settings when the camera was calibrated (e.g., in accordance with the method of FIG. 1), then control is passed to block 230 and the photos are accepted for further evaluation. If the settings are different, then the person who took the photos is notified via electronic means (e.g., email) or via phone to recalibrate their camera to reflect the new settings (block 225) or send new photos with a previously calibrated and unaltered camera. Control is passed to decision diamond 240 to determine whether new calibration settings have been sent. If new calibration settings are sent, then the control is passed to block 235 and the new camera characteristics are stored. Control is subsequently passed to block 230. If new recalibration settings are not sent, then control passes to decision diamond 245 to evaluate whether new pictures have been sent. If new pictures have been sent, then control passes to decision diamond 220 to evaluate the camera settings file that accompanies the photos and the process of camera settings comparison repeats as discussed previously. If new photos are not sent, then control passes to block 250 and photos are not accepted for use in further evaluation.

FIG. 3 shows the coordinate reference frames of the object, O, the camera, C, and the image, I. The extrinsic parameters map 3D coordinates in the object reference frame to 3D coordinates in the camera reference frame. Likewise, the intrinsic parameters map the 3D coordinates in the camera reference frame to the projected 2D coordinates in the image plane.

Given a 3D point, P_(o), in the object's reference frame, its coordinates in the camera reference frame are given by: P _(c) =RP _(o) +T _(o) where R is a 3×3 rotation matrix and T_(o) is the position of the object reference frame with respect to the camera. This can also be written as:

$P_{C} = {{{RP}_{O} - {RT}_{C}} = {\begin{bmatrix} R & {- {RT}_{C}} \\ 0 & 1 \end{bmatrix}\begin{Bmatrix} P_{O} \\ 1 \end{Bmatrix}}}$ where T_(c) is the position of the camera reference frame with respect to the object.

FIG. 4 shows the relationship between the camera reference frame and the image plane. By way of similar triangles, it can be shown that the point, P_(c), with coordinates (X,Y,Z) in the camera reference frame, projects to (fX/Z, fY/Z) on the image plane. Using homogenous coordinates, this can be expressed as the matrix multiplication:

$P_{IC} = {\begin{Bmatrix} {fX} \\ {fY} \\ Z \end{Bmatrix} = {\begin{bmatrix} f & \; & \; & 0 \\ \; & f & \; & 0 \\ \; & \; & 1 & 0 \end{bmatrix}\begin{Bmatrix} X \\ Y \\ Z \\ 1 \end{Bmatrix}}}$ where P_(IC) indicates that the projected point is expressed with respect to the principal point of the image. Usually, the image coordinate system is offset from the principal point as shown in FIG. 5.

In non-homogenous coordinates, the image point is (fX/Z+U_(o), fY/Z+V_(o)), which can be written as a matrix multiplication using homogenous coordinates, i.e.,

$P_{I} = {\begin{Bmatrix} {{fX} + {ZU}_{O}} \\ {{fY} + {ZV}_{O}} \\ Z \end{Bmatrix} = {\begin{bmatrix} f & \; & U_{O} & 0 \\ \; & f & V_{O} & 0 \\ \; & \; & 1 & 0 \end{bmatrix}\begin{Bmatrix} X \\ Y \\ Z \\ 1 \end{Bmatrix}}}$ Now, writing

$K = \begin{bmatrix} f & \; & U_{O} \\ \; & f & V_{O} \\ \; & \; & 1 \end{bmatrix}$ Here, the matrix K is called the camera calibration matrix.

The entire mapping from object reference frame to image plane can be written as:

$P_{I} = {\text{[}K\mspace{20mu}\overset{->}{0}{\text{]}\begin{bmatrix} R & {- {RT}_{C}} \\ 0 & 1 \end{bmatrix}}\begin{Bmatrix} P_{O} \\ 1 \end{Bmatrix}}$ Or, more succinctly as:

$P_{I} = {{{K\left\lbrack {R\mspace{20mu} t} \right\rbrack}\begin{Bmatrix} P_{O} \\ 1 \end{Bmatrix}\mspace{20mu}{where}\mspace{14mu} t} = {- {RT}_{C}}}$ It should be noted that the intrinsic parameters are totally contained in the camera calibration matrix, K, while the extrinsic parameters are described by the [R t] matrix.

The pinhole camera model assumes image coordinates are Euclidean with equal scales in both directions. In the case of digital cameras, it is possible to have non-square pixels. It is also possible, albeit unlikely, that the pixels aren't perpendicular, i.e., the pixels are skew. In this case, the camera calibration matrix, K, can have the more general form:

$K = \begin{bmatrix} f_{X} & s & U_{O} \\ \; & f_{Y} & V_{O} \\ \; & \; & 1 \end{bmatrix}$ The terms f_(x) and f_(y) account for non-square pixels and s is the skew parameter. In most cases, the skew term will be zero.

The calibration procedure may begin by taking several images of a planar calibration pattern with precisely known geometry. For example, in one embodiment a calibration pattern may be a checkerboard pattern with regularly spaced rectangles or squares such as shown in FIG. 6.

If the object reference frame is chosen such that the XY plane is the plane (Z=0), then the relationship between image coordinates and object coordinates can be expressed as:

$P_{I} = {{{K\left\lbrack {r_{1}\mspace{20mu} r_{2}\mspace{20mu} r_{3}\mspace{20mu} t} \right\rbrack}\begin{Bmatrix} P_{Ox} \\ P_{Oy} \\ 0 \\ 1 \end{Bmatrix}} = {{K\left\lbrack {r_{1}\mspace{20mu} r_{2}\mspace{20mu} t} \right\rbrack}\begin{Bmatrix} P_{Ox} \\ P_{Oy} \\ 1 \end{Bmatrix}\mspace{14mu}{or}}}$ $P_{I} = {H\begin{Bmatrix} P_{Ox} \\ P_{Oy} \\ 1 \end{Bmatrix}}$ Here, the 3×3 matrix, H, is called a planar homography and it maps points from the calibration plane to their corresponding image coordinates. Given an image of the calibration pattern, this homography can be estimated.

Denoting the homography H=[h₁ h₂ h₃] gives [h₁ h₂ h₃]=K[r₁ r₂ t] Since r1 and r2 are orthogonal, it can be shown that h ₁ ^(T) K ^(−T) K ⁻¹ h ₂=0 h ₁ ^(T) K ^(−T) K ⁻¹ h ₁ −h ₂ ^(T) K ^(−T) K ⁻¹ h ₂=0 Defining ω as ω=K^(−T)K⁻¹ gives h₁ ^(T)ωh₂=0 h ₁ ^(T) ωh ₁ −h ₂ ^(T) ωh ₂=0 Here, ω is known as the image of the absolute conic.

In terms of the calibration matrix, ω can be expressed as:

$\omega = {{K^{- T}K^{- 1}} = {\begin{bmatrix} \omega_{11} & \omega_{12} & \omega_{13} \\ \omega_{21} & \omega_{22} & \omega_{23} \\ \omega_{31} & \omega_{32} & \omega_{33} \end{bmatrix} = \begin{bmatrix} \frac{1}{f_{x}^{2}} & 0 & {- \frac{U_{0}}{f_{x}^{2}}} \\ 0 & \frac{1}{f_{y}^{2}} & {- \frac{V_{0}}{f_{x}^{2}}} \\ {- \frac{U_{0}}{f_{x}^{2}}} & {- \frac{V_{0}}{f_{y}^{2}}} & \left( {\frac{U_{0}^{2}}{f_{x}^{2}} + \frac{V_{0}^{2}}{f_{y}^{2}} + 1} \right) \end{bmatrix}}}$ Since ω is symmetric, it can be represented by the 6 vector {right arrow over (ω)}×=[ω₁₁ ω₁₂ ω₂₂ ω₁₃ ω₂₃ ω₃₃]^(T)

Using the estimated homography of each calibration image and the constraints on the calibration matrix, a set of linear equations can be written in {right arrow over (ω)}, i.e. h_(i) ^(T)ωh_(j)=v_(ij) ^(T){right arrow over (ω)} where v_(ij)=└h_(i1)h_(j1), h_(i1)h_(j2)+h_(i2)h_(j1), h_(i2)h_(j2), h_(i3)h_(j1)+h_(il)h_(j3), h_(i3)h_(j2)+h_(i2)h_(j3), h_(i3)h_(j3)┘ Therefore, the constraints on each homography result in the following linear system

${\begin{bmatrix} v_{12}^{T} \\ \left( {v_{11} - v_{22}} \right)^{T} \end{bmatrix}\overset{->}{\omega}} = {{V\;\overset{->}{\omega}} = \overset{->}{0}}$ For n images, the matrix V is a 2n×6 matrix. This system can be solved if there are at least 3 images of the calibration plane.

The pinhole camera model is an idealized camera model. Real cameras have imperfect lenses that produce nonlinear effects. For example, when the magnification of a lens differs at its edges and at its center, the image of a square object will be distorted. If the magnification is lower at the edges than at the center, a square object will appear to have rounded edges. This type of distortion is called “barrel” distortion. If the magnification is greater at the edges than at the center, the image will exhibit “pincushion” distortion. FIGS. 7 A-C illustrate both of these radial distortion effects, along with an undistorted image.

Radial distortion can be corrected using a nonlinear distortion factor. Studies have shown that the distortion can be modeled as a polynomial with respect to the squared radial distance, i.e.,

$\begin{bmatrix} {\delta\; u} \\ {\delta\; v} \end{bmatrix} = \begin{bmatrix} {\overset{\sim}{u}\left( {{k_{1}r^{2}} + {k_{2}r^{4}} + \ldots}\mspace{11mu} \right)} \\ {\overset{\sim}{v}\left( {{k_{1}r^{2}} + {k_{2}r^{4}} + \ldots}\mspace{11mu} \right)} \end{bmatrix}$ where (δu,δv)is the amount of distortion in the x and y-directions, respectively,(ũ,{tilde over (v)})is an image point projected via the pinhole camera model, k₁, k₂, . . . are the distortion coefficients, and r=√{square root over (ũ²+{tilde over (v)}²)} is the radius.

Thus, the pinhole camera model can be extended to include nonlinear distortion, i.e.,

$\begin{bmatrix} u \\ v \end{bmatrix} = {{\begin{bmatrix} {f_{X}\left( {\overset{\sim}{u} + {\delta\; u}} \right)} \\ {f_{Y}\left( {\overset{\sim}{v} + {\delta\; v}} \right)} \end{bmatrix} + {\begin{bmatrix} U_{0} \\ V_{0} \end{bmatrix}\begin{bmatrix} u \\ v \end{bmatrix}}} = {\begin{bmatrix} {f_{X}{\overset{\sim}{u}\left( {1 + {k_{1}r^{2}} + {k_{2}r^{4}}} \right)}} \\ {f_{Y}{\overset{\sim}{v}\left( {1 + {k_{1}r^{2}} + {k_{2}r^{4}}} \right)}} \end{bmatrix} + \begin{bmatrix} U_{0} \\ V_{0} \end{bmatrix}}}$ It has been shown that radial distortion is adequately modeled by the first two polynomial terms; thus, the higher order terms have been dropped.

To calculate the optimal camera calibration parameters, the problem is posed as a nonlinear minimization problem. Given n images of a calibration target with m points, the objective is to minimize the reprojection error, i.e.,

$f = {\sum\limits_{i = 1}^{n}{\sum\limits_{j = 1}^{m}{{m_{ij} - {\overset{\Cup}{m}\left( {K,k_{1},k_{2},R_{i},t_{i},P_{j}} \right)}}}}}$ where {hacek over (m)}(K,k₁, k₂, R_(i), t_(i), P_(j)) is the projection of point P_(j) in image i. This nonlinear minimization problem is solved via the Levenberg-Marquardt Algorithm.

Of the non-linear least squares algorithms, the Levenberg-Marquardt (LM) algorithm has been the most popular because of its tolerance to missing data and its convergence properties. The basis for the LM algorithm is a linear approximation of the objective function in the neighborhood of the control variables. For a small perturbation of the control variables, δ_(p), a Taylor series expansion yields a linear approximation for the residual, i.e., f(p+δ _(p))≈f(p)+Jδ _(p) The gradient of this form of the objective function is given by g=J ^(T) f(p+δ _(p))=J ^(T) [f(p)+Jδ _(p)] Setting the gradient to zero yields the so-called normal equations, i.e., J ^(T) Jδ _(p) =J ^(T) f(p) Solving this linear system gives the δ_(p) that is a local minimizer of the objective function.

The LM algorithm uses a slightly different version of these equations called the augmented normal equations, i.e., Nδ _(p) =J ^(T) f(p) The off-diagonal elements of the matrix N are the same as the corresponding elements of the J^(T)J matrix, but the diagonal elements of N are given by N _(ii) =μ+└J ^(T) J┘ _(ii) for some μ>0.

This strategy of altering the diagonal elements is called damping and the factor, μ, is called the damping term. If the solution of the normal equations yields a new parameter estimate that reduces the value of the objective function, the update is accepted and the process is repeated with a decreased damping term. Otherwise, the damping term is increased and the normal equations are solved again until the objective function is decreased. In a single iteration of the LM algorithm, the normal equations are solved until an acceptable parameter estimate is found.

The augmented normal equations can also be written as the following linear system: (H+μI)δ_(p) =g where H is the Gauss-Newton approximation of the Hessian, J^(T)J, and I is the identity matrix with the same size as H.

The LM algorithm terminates when one of three stopping criteria are met: (1) The norm of the gradient of the residual, i.e. j^(T)f(p), falls below a threshold, ε₁; (2) The relative change in the step size falls below a threshold, ε₂; and (3) The number of LM iterations exceeds some maximum value, k_(max).

Table 1 shows pseudocode for a LM algorithm in accordance with one embodiment of the present invention.

TABLE 1 iterations = 0; p = p0; residual = x − f(p); error = | residual |; H = J^(T)J; // Hessian (Gauss-Newton approximation) g = J^(T) residual; // Gradient Converged = ( |Grad| < el ); μ = maximum diagonal element of Hessian; while (not converged) and (iterations < max_iterations) accept_step = false; while (not accept_step) and (not converged) // Solve normal equations step = solution  of  (H + μI)δ_(p) = g // Evaluate error, Gradient, and Hessian at new step p_new = p + step; residual_new = x − f(p_new); error_new = | residual_new |; H_new = J_(new) ^(T)J_(new); g_new = J_(new) ^(T) residual_new; // Check step against threshold if ( |step| < e2|p| ) converged = true; else // Check acceptability of step accept_step = error_new < error; if (accept_step) // Accept step. // Update control variables, error, // Hessian, Gradient, and damping parameter. p = p_new; error = error_new; H = H_new; g = g_new; μ = μ / 10; else // Reject step and update damping parameter. μ = μ * 10; end; end; endwhile // Convergence test based on Gradient norm convergence = ( |Grad| < el) // Update iteration count iterations = iterations + 1; endwhile

This section describes a technique whereby the structure of the Jacobian matrix can be exploited to reduce the overall complexity of a system and greatly improve the computational performance.

For illustrative purposes, the following will be applied to bundle adjustment; however, the technique can be applied to camera calibration, the pose estimation problem, and many other computer vision problems.

Assume that n 3D points are visible in m images. Let {circumflex over (x)}_(ij) represent the projection of point i on image j. Let a_(j) represent the control variables of each camera j and let b_(i) represent the control variables for each 3D point i.

For n points in m images, the observed image points are x=(x ₁₁ ^(T) , . . . , x _(1m) ^(T) , . . . , x ₂₁ ^(T) , . . . , x _(n1) ^(T) , . . . , x _(nm) ^(T))^(T) Similarly, the estimated (projected) image points of the n 3D points are given by {circumflex over (x)}=({circumflex over (x)} ₁₁ ^(T) , . . . , {circumflex over (x)} _(1m) ^(T) , . . . , {circumflex over (x)} _(2l) ^(T) , . . . , {circumflex over (x)} _(2m) ^(T) , . . . , {circumflex over (x)} _(n1) ^(T) , . . . , {circumflex over (x)} _(nm) ^(T))^(T) where each {circumflex over (x)}_(ij)=Q(a_(j),b_(i)) is a predicted image point from a mathematical camera model, e.g. pinhole projection model. The error or residual vector is defined as ε=x−{circumflex over (x)}

The control variables are partitioned by the vector P=(a ₁ ^(T) , . . . , a _(m) ^(T) , b ₁ ^(T) , . . . , b _(m) ^(T))^(T) For simplicity, the Jacobian for n=4 points in m=3 views is given by

$J = {\frac{\partial{\hat{x}(P)}}{\partial P} = \begin{pmatrix} A_{11} & 0 & 0 & B_{11} & 0 & 0 & 0 \\ 0 & A_{12} & 0 & B_{12} & 0 & 0 & 0 \\ 0 & 0 & A_{13} & B_{13} & 0 & 0 & 0 \\ A_{21} & 0 & 0 & 0 & B_{21} & 0 & 0 \\ 0 & A_{22} & 0 & 0 & B_{22} & 0 & 0 \\ 0 & 0 & A_{23} & 0 & B_{23} & 0 & 0 \\ A_{31} & 0 & 0 & 0 & 0 & B_{31} & 0 \\ 0 & A_{32} & 0 & 0 & 0 & B_{32} & 0 \\ 0 & 0 & A_{33} & 0 & 0 & B_{33} & 0 \\ A_{41} & 0 & 0 & 0 & 0 & 0 & B_{41} \\ 0 & A_{42} & 0 & 0 & 0 & 0 & B_{42} \\ 0 & 0 & A_{43} & 0 & 0 & 0 & B_{43} \end{pmatrix}}$ where $A_{ij} = {{\frac{\partial{\hat{x}}_{ij}}{\partial a_{j}}\mspace{14mu}{and}\mspace{14mu} B_{ij}} = \frac{\partial{\hat{x}}_{ij}}{\partial b_{i}}}$

The first m columns of the Jacobian are the partial derivatives of the image residuals with respect to the parameters of camera j. Since the camera parameters for one image do not affect the projected image points of other images, there are numerous zeros in these columns. Similarly, the last n columns of the Jacobian are the partial derivatives of the image residuals with respect to the 3D structure parameters. These columns also have numerous zeros because of the lack of interaction between parameters. Reconsider the normal equations, i.e., J^(T)Jδ=J^(T)ε The left hand side of the normal equations is given by

${J^{T}J} = \begin{pmatrix} {\sum\limits_{i}{A_{i\; 1}^{T}A_{i\; 1}}} & 0 & 0 & {A_{11}^{T}B_{11}} & {A_{21}^{T}B_{21}} & {A_{31}^{T}B_{31}} & {A_{41}^{T}B_{41}} \\ 0 & {\sum\limits_{i}{A_{i\; 2}^{T}A_{i\; 2}}} & 0 & {A_{12}^{T}B_{12}} & {A_{22}^{T}B_{22}} & {A_{32}^{T}B_{32}} & {A_{42}^{T}B_{42}} \\ 0 & 0 & {\sum\limits_{i}{A_{i\; 3}^{T}A_{i\; 3}}} & {A_{13}^{T}B_{13}} & {A_{23}^{T}B_{23}} & {A_{33}^{T}B_{33}} & {A_{43}^{T}B_{43}} \\ {B_{11}^{T}A_{11}} & {B_{12}^{T}A_{12}} & {B_{13}^{T}A_{13}} & {\sum\limits_{j}{B_{1\; j}^{T}B_{1\; j}}} & 0 & 0 & 0 \\ {B_{21}^{T}A_{21}} & {B_{22}^{T}A_{22}} & {B_{33}^{T}A_{33}} & 0 & {\sum\limits_{j}{B_{2\; j}^{T}B_{2\; j}}} & 0 & 0 \\ {B_{31}^{T}A_{31}} & {B_{32}^{T}A_{32}} & {B_{33}^{T}A_{33}} & 0 & 0 & {\sum\limits_{j}{B_{3\; j}^{T}B_{3\; j}}} & 0 \\ {B_{41}^{T}A_{41}} & {B_{42}^{T}A_{42}} & {B_{43}^{T}A_{43}} & 0 & 0 & 0 & {\sum\limits_{j}{B_{4\; j}^{T}B_{4\; j}}} \end{pmatrix}$ And the right hand side is given by

${J^{T}ɛ} = \begin{pmatrix} {\sum\limits_{i}{A_{i\; 1}^{T}ɛ_{i\; 1}}} \\ {\sum\limits_{i}{A_{i\; 2}^{T}ɛ_{i\; 2}}} \\ {\sum\limits_{i}{A_{i\; 3}^{T}ɛ_{i\; 3}}} \\ {\sum\limits_{j}{B_{1j}^{T}ɛ_{1j}}} \\ {\sum\limits_{j}{B_{2j}^{T}ɛ_{2j}}} \\ {\sum\limits_{j}{B_{3j}^{T}ɛ_{3j}}} \\ {\sum\limits_{j}{B_{4j}^{T}ɛ_{4j}}} \end{pmatrix}$

Substituting U_(j), V_(i), W_(ij), ε_(aj) and ε_(bi) for

${\sum\limits_{i}{A_{ij}^{T}A_{ij}}},{\sum\limits_{j}{B_{ij}^{T}B_{ij}}},{A_{ij}^{T}B_{ij}},{\sum\limits_{i}{A_{ij}^{T}ɛ_{ij}\mspace{14mu}{and}\mspace{14mu}{\sum\limits_{j}{B_{ij}^{T}ɛ_{ij}\mspace{14mu}{respectively}}}}},$ the normal equations can be written in a more compact form, i.e.,

${\begin{pmatrix} U_{1} & 0 & 0 & W_{11} & W_{21} & W_{31} & W_{41} \\ 0 & U_{2} & 0 & W_{12} & W_{22} & W_{32} & W_{42} \\ 0 & 0 & U_{3} & W_{13} & W_{23} & W_{33} & W_{43} \\ W_{11}^{T} & W_{12}^{T} & W_{13}^{T} & V_{1} & 0 & 0 & 0 \\ W_{21}^{T} & W_{22}^{T} & W_{23}^{T} & 0 & V_{2} & 0 & 0 \\ W_{31}^{T} & W_{32}^{T} & W_{33}^{T} & 0 & 0 & V_{3} & 0 \\ W_{41}^{T} & W_{42}^{T} & W_{43}^{T} & 0 & 0 & 0 & V_{4} \end{pmatrix}\begin{pmatrix} \delta_{a\; 1} \\ \delta_{a\; 1} \\ \delta_{a\; 1} \\ \delta_{b\; 1} \\ \delta_{b\; 2} \\ \delta_{b\; 3} \\ \delta_{b\; 4} \end{pmatrix}} = \begin{pmatrix} ɛ_{a\; 1} \\ ɛ_{a\; 2} \\ ɛ_{a\; 3} \\ ɛ_{b\; 1} \\ ɛ_{b\; 2} \\ ɛ_{b\; 3} \\ ɛ_{b\; 4} \end{pmatrix}$ The normal equations can be written in even more compact form, i.e.,

${{{\begin{pmatrix} U^{*} & W \\ W^{T} & V^{*} \end{pmatrix}\begin{pmatrix} \delta_{a} \\ \delta_{b} \end{pmatrix}} = {{\begin{pmatrix} ɛ_{a} \\ ɛ_{b} \end{pmatrix}\mspace{14mu}{where}\mspace{14mu} U^{*}} = \begin{pmatrix} U_{1}^{*} & 0 & 0 \\ 0 & U_{2}^{*} & 0 \\ 0 & 0 & U_{3}^{*} \end{pmatrix}}},{V^{*} = \begin{pmatrix} V_{1}^{*} & 0 & 0 & 0 \\ 0 & V_{2}^{*} & 0 & 0 \\ 0 & 0 & V_{3}^{*} & 0 \\ 0 & 0 & 0 & V_{4}^{*} \end{pmatrix}},{{{and}\mspace{14mu} W} = \begin{pmatrix} W_{11} & W_{21} & W_{31} & W_{41} \\ W_{12} & W_{22} & W_{32} & W_{42} \\ W_{13} & W_{23} & W_{33} & W_{43} \end{pmatrix}}}{The}\mspace{14mu}*{indicates}\mspace{14mu}{that}\mspace{14mu}{the}\mspace{14mu}{diagonal}\mspace{14mu}{elements}\mspace{14mu}{are}\mspace{14mu}{augmented}\mspace{14mu}{for}$ the  LM  algorithm.

Multiplying the compact form of the augmented normal equations by

$\begin{pmatrix} I & {{- W}\; V^{*{- 1}}} \\ W^{T} & V^{*} \end{pmatrix}\quad$ results in

${\begin{pmatrix} {U^{*} - {W\; V^{*{- 1}}W^{T}}} & 0 \\ W^{T} & V^{*} \end{pmatrix}\begin{pmatrix} \delta_{a} \\ \delta_{b} \end{pmatrix}} = \begin{pmatrix} {ɛ_{a} - {W\; V^{*{- 1}}ɛ_{b}}} \\ ɛ_{b} \end{pmatrix}$

Since the upper right block of the left hand matrix is zero, the δ_(a) vector can be determined by solving the upper j set of equations, i.e., (U*−WV* ⁻¹ W ^(T))δ_(a)=ε_(a) −WV* ⁻¹ε_(b) After solving for δ_(a), δ_(b) can be solved by backsubstitution into the bottom i set of equations. Denoting Y_(ij)=W_(ij)V*_(i) ⁻¹, the upper j set of equations becomes

${\begin{pmatrix} {U_{1}^{*} - {\sum\limits_{i}{Y_{i\; 1}W_{i\; 1}^{T}}}} & {- {\sum\limits_{i}{Y_{i\; 1}W_{i\; 2}^{T}}}} & {- {\sum\limits_{i}{Y_{i\; 1}W_{i\; 3}^{T}}}} \\ {- {\sum\limits_{i}{Y_{i\; 2}W_{i\; 1}^{T}}}} & {U_{2}^{*} - {\sum\limits_{i}{Y_{i\; 2}W_{i\; 2}^{T}}}} & {- {\sum\limits_{i}{Y_{i\; 2}W_{i\; 3}^{T}}}} \\ {- {\sum\limits_{i}{Y_{i\; 3}W_{i\; 1}^{T}}}} & {- {\sum\limits_{i}{Y_{i\; 3}W_{i\; 2}^{T}}}} & {U_{3}^{*} - {\sum\limits_{i}{Y_{i\; 3}W_{i\; 3}^{T}}}} \end{pmatrix}\begin{pmatrix} \delta_{a\; 1} \\ \delta_{a\; 2} \\ \delta_{a\; 3} \end{pmatrix}} = \begin{pmatrix} {ɛ_{a\; 1} - {\sum\limits_{i}Y_{i\; 1}} - ɛ_{bi}} \\ {ɛ_{a\; 2} - {\sum\limits_{i}Y_{i\; 2}} - ɛ_{bi}} \\ {ɛ_{a\; 3} - {\sum\limits_{i}Y_{i\; 3}} - ɛ_{bi}} \end{pmatrix}$ which can be solved for δ_(a). Each δ_(bi) is then given by

$\delta_{bi} = {V_{i}^{*{- 1}}\left( {ɛ_{bi} - {\sum\limits_{j}{w_{ij}^{T}\delta_{aj}}}} \right)}$ Table 2 summarizes the technique, which can be generalized to n points in m views.

TABLE 2 1. Compute the derivative matrices, ${A_{ij} = {{\frac{\partial{\hat{x}}_{ij}}{\partial a_{j}}\mspace{20mu}{and}\mspace{20mu} B_{ij}} = \frac{\partial{\hat{x}}_{ij}}{\partial b_{i}}}},$ and error vector ε_(ij) = x_(ij) − {circumflex over (x)}_(ij) 2. Compute the intermediate expressions $\begin{matrix} {U_{j} = {\sum\limits_{i}{A_{ij}^{T}A_{ij}}}} & {\mspace{11mu}{V_{i} = {\sum\limits_{j}{B_{ij}^{T}B_{ij}}}}} & {{W_{ij} = {A_{ij}^{T}B_{ij}}},} \\ {\mspace{70mu}{ɛ_{aj} = {\sum\limits_{i}{A_{ij}^{T}ɛ_{ij}}}}} & {{ɛ_{bi} = {\sum\limits_{j}{B_{ij}^{T}ɛ_{ij}}}}} & \; \end{matrix}\quad$ 3. Multiply the diagonals of U and V by 1 + μ 4. Compute the inverse of V. Since V is not used again, it can be replaced by the inverse. 5 Find δ_(a) by solving (U* − WV*⁻¹W^(T))δ_(a) = ε_(a) − WV*⁻¹ε_(b) 6. Find δ_(b) by backsubstitution

Estimating the spatial relationship between the object and the camera, or the pose estimation problem, is a central issue in close-range photogrammetry and computer vision applications. The goal is to determine the rigid transformation that relates the object and camera reference frames. Typically, this rigid transformation is parameterized by a rotation matrix, R, and a translation, t. FIG. 8 illustrates this transformation.

The data used to solve this problem are a set of point correspondences -3D coordinates of the object, or “control points” and their 2D projections onto the image plane. Typically, the control points are expressed with respect to the object reference frame and their projections are expressed with respect to the camera reference frame. The algorithm described herein is based on the work by Hager, et al.

Given a set of at least 3 control points, {p_(i)}, the corresponding camera space coordinates, {q_(i)}, are given by: q _(i) =Rp _(i) +t The camera reference frame is chosen such that the origin is at the center of projection and the optical axis is in the positive z-direction. The control points are projected onto the plane in the camera reference frame where z=1, the so-called normalized image plane. In the camera reference frame, the control points are given by:

$\begin{Bmatrix} q_{ix} \\ q_{iy} \\ q_{iz} \end{Bmatrix} = \begin{Bmatrix} {{r_{1}^{T}p_{i}} + t_{x}} \\ {{r_{2}^{T}p_{i}} + t_{y}} \\ {{r_{3}^{T}p_{i}} + t_{z}} \end{Bmatrix}$ where r₁ ^(T), r₂ ^(T), and r₃ ^(T) are the rows of the rotation matrix, R. If a ray is drawn from the camera reference frame origin to the control point, it intersects the normalized image plane at the point V_(i)=(u_(i), v_(i), 1). This can be expressed as:

$u_{i} = \frac{{r_{1}^{T}p_{i}} + t_{x}}{{r_{3}^{T}p_{i}} + t_{z}}$ $v_{i} = \frac{{r_{2}^{T}p_{i}} + t_{y}}{{r_{3}^{T}p_{i}} + t_{z}}$ Or $V_{i} = {\frac{1}{{r_{3}^{T}p_{i}} + t_{z}}\left( {{Rp}_{i} + t} \right)}$

This is known as the collinearity equation. In classical photogrammetry, the collinearity equation is often used as the basis for solving the pose estimation problem. The pose is iteratively refined such that the image residual is minimized, i.e.,

$\min\limits_{R,t}{\sum\limits_{i = 1}^{n}\left\lbrack {\left( {{\hat{u}}_{i} - \frac{{r_{1}^{T}p_{i}} + t_{x}}{{r_{3}^{T}p_{i}} + t_{z}}} \right)^{2} + \left( {{\hat{v}}_{i} - \frac{{r_{2}^{T}p_{i}} + t_{y}}{{r_{3}^{T}p_{i}} + t_{z}}} \right)^{2}} \right\rbrack}$ Here, (û_(i), {circumflex over (v)}_(i)) are the coordinates of the control point observed in the normalized image plane.

This problem can also be expressed in terms of minimizing the overall residual in object space as shown in FIG. 9. The line-of-sight projection matrix is defined as:

${\hat{V}}_{i} = \frac{{\hat{v}}_{i}{\hat{v}}_{i}^{T}}{{\hat{v}}_{i}^{T}{\hat{v}}_{i}}$

When a scene point is multiplied by this matrix, it projects the point orthogonally to the line of sight defined by the image point {circumflex over (v)}_(i). In the presence of error, there will be a residual vector between the scene point, q_(i) and its orthogonal projection, i.e., e _(i) ={circumflex over (V)} _(i)(Rp _(i) +t)−(Rp _(i) +t)

Therefore, the optimal camera pose is that which minimizes the overall residual in object space, i.e.,

${\min\limits_{R,t}{\sum\limits_{i = 1}^{n}{e_{i}}^{2}}} = {{{\text{(}I} - {{\hat{V}}_{i}\text{)}\left( {{R\; p_{i}} + t} \right)}}}^{2}$

If the camera space coordinates of the control points could be obtained by other means, e.g. digitized with a FARO arm, then each control point is related by the rigid transformation: q _(i) =Rp _(i) +t Given at least 3 or more non-collinear control points, R and t can be obtained by solving the least-squares problem:

${\min\limits_{R,t}{{{R\; p_{i}} + t - q_{i}}}^{2}},{{{subject}\mspace{14mu}{to}\mspace{14mu} R^{T}R} = I}$

This type of constrained least-squares problem can be solved analytically using singular value decomposition (SVD). Defining the centroids of the camera and scene points as:

$\overset{\_}{q} = {\frac{1}{n}{\sum\limits_{i = 1}^{n}q_{i}}}$ $\overset{\_}{p} = {\frac{1}{n}{\sum\limits_{i = 1}^{n}p_{i}}}$ And defining the position of camera and scene points, relative to their centroids as: q′ _(i) = q−q _(i) p′ _(i) = p−p _(i) The sample cross covariance is

${\frac{1}{n}M},{{{where}\mspace{14mu} M} = {{q^{\prime} \cdot p^{\prime}} = {\sum\limits_{i = 1}^{n}{q_{i}^{\prime}p_{i}^{\prime\; T}}}}}$

If R* and t* are the optimal rotation and translation, then they must satisfy R*=arg max_(R) trace(R ^(T) M) t* = q−R* p Let (U,Σ,V) be a SVD of M, then the solution of R* is given by R*=VU^(T)

Thus, the only data required to calculate the optimal rotation matrix are the 3D coordinates (in camera and object space) relative to their centroids. The optimal translation is then a simple function of the optimal rotation and the centroids.

In one embodiment, an algorithm may be referred to as the orthogonal iteration (01) algorithm. This approach is to use the object-space collinearity error and restructure the problem so that it resembles the absolute orientation problem. The first step is to define the objective function based on object-space error, i.e.,

${E\left( {R,t} \right)} = {{\sum\limits_{i = 1}^{n}{e_{i}}^{2}} = {{\left( {I - {\hat{V}}_{i}} \right)\left( {{Rp}_{i} + t} \right)}}^{2}}$ Since the objective function is quadratic in t, the optimal translation can be calculated in closed-form as:

${t(R)} = {\frac{1}{n}\left( {I - {\frac{1}{n}{\sum\limits_{j}{\hat{V}}_{j}}}} \right)^{- 1}{\sum\limits_{j}{\left( {V_{j} - I} \right){Rp}_{j}}}}$ Defining q _(i)(R)={circumflex over (V)} _(i) [Rp _(i) +t(R)] Then the objective function can be rewritten in the following form:

${E(R)} = {\sum\limits_{i = 1}^{n}{{{Rp}_{i} + {t(R)} - {q_{i}(R)}}}^{2}}$ This equation has the same form as the absolute orientation problem; however, SVD cannot be used to solve for R because the sample cross covariance is also a function of R, i.e.,

${M(R)} = {{{q^{\prime}(R)} \cdot p^{\prime}} = {\sum\limits_{i = 1}^{n}{{q_{i}^{\prime}(R)}p_{i}^{\prime\; T}}}}$ where p′_(i)=p_(i)− p and q′₁(R)=q_(i)(R)− q(R) Instead, the following iterative approach is used. Given the k^(th) estimate R^((k)), t^((k))=t(R^((k)), and q_(i) ^(k)R^((k))p_(i)+t^((k)), the (k+1)^(th) estimate of R, R^((k+1)) is obtained by solving the following absolute orientation problem:

$\begin{matrix} {R^{({k + 1})} = {\arg\;{\min_{R}{\sum\limits_{i = I}^{n}{{{Rp}_{i} + t - {{\hat{V}}_{i}q_{i}^{(k)}}}}^{2}}}}} \\ {= {\arg\;{\max_{R}{{trace}\left\lbrack {R^{{(k)}^{T}}{M\left( R^{(k)} \right)}} \right\rbrack}}}} \end{matrix}$ Then, the (k+1)^(th) estimate of t is given by t ^((k+1)) =t(R ^((k+1))) This process is repeated until the estimate of R satisfies:

$R^{*} = {\arg\;{\min_{R}{\sum\limits_{i = I}^{n}{{{Rp}_{i} + t - {{\hat{V}}_{i}\left( {{R^{*}p_{i}} + {t\left( R^{*} \right)}} \right)}}}^{2}}}}$ within some specified tolerance.

Referring now to FIG. 10, photos of a vehicle involved in an accident may be analyzed. As shown in FIG. 10, method 300 may begin by receiving photos in a computer system at a central location and which can be evaluated to determine if they were taken with or without markers in the photos, as shown in block 305. More specifically, in decision diamond 310, the computer evaluates whether markers including, for example, control points and damage points, were included in the photos. Various markers may be used.

Referring still to FIG. 10, if the photos were taken with markers, control is passed to block 370, and the location of pose, control points and damage points can be determined automatically by finding the points in the photos via an algorithm. Automatic target recognition may be used in photogrammetric systems to reduce the amount of user interaction and to increase measurement accuracy. Targets can be automatically detected by examining variations in image intensity levels. Targets made from retro-reflective material are often used because they reflect significantly more light than a normal white textured surface. In some implementations, a circular retro-reflective target can be used. The first step in identifying the location of the target is to find areas within the image where the image intensity is rapidly changing. In some embodiments, the change in intensity can be defined by the steepness or the slope of the image intensity, which is also known as the image gradient. Images of retro-reflective targets are characterized by large image gradients.

After candidate regions (i.e., areas with large image gradients) have been identified, the next step is to check the geometry of the region. For circular targets, candidate regions can be verified by comparing them with an ellipse. Points on the perimeter of candidate regions are used to calculate the least-squares fit of an ellipse. Spurious regions are eliminated if the region perimeter does not fit the ellipse within some statistical measure.

Once candidate regions have been identified, the final step is to locate the center of the target. One commonly used estimate for this center is the intensity-weighted centroids, which is defined by

$\begin{pmatrix} x_{c} \\ y_{c} \end{pmatrix} = \frac{\sum\limits_{i = 1}^{n}{\sum\limits_{j = 1}^{m}{g_{ij}\begin{pmatrix} x_{i} \\ y_{j} \end{pmatrix}}}}{\sum\limits_{i = 1}^{n}{\sum\limits_{j = 1}^{m}g_{ij}}}$ where x_(i), y_(i) are the pixel coordinates and g_(ij) are the intensity levels within a (n×m) window covering the target region.

Note the pose helps set the frame of reference regarding which direction the pictures were taken relative to the vehicle. For example, the pose determines if the photo was taken from the driver's or passenger's side of the vehicle. Once the frame of reference is determined, then points of known location are identified to help determine the scale of the photo from known vehicle dimensions. These control points could include areas of the vehicle that are unlikely to be damaged in an accident such as the four corners of the windshield, the center of the wheel axle, etc. Finally, the points of potential damage can be located on each picture that is taken. For example, a standard set of points on the hood, grille and bumper may be identified on each of the different views of the front of a vehicle. Control is then passed to block 375 to store the estimate of the pose for each of the photos using the pose and control point data. Such stored pose estimates may be later used in determining a crush profile for the vehicle associated with the estimates.

Still referring to FIG. 10, if the photos were taken without markers, control is passed to block 315, to determine the appropriate vehicle example photos to display to allow manual input of the pose for each photo. Control is passed to decision diamond 320 to determine if the damage was to the rear of the vehicle. If the damage was to the rear of the vehicle, control is passed to block 325 to display the rear of an appropriate vehicle to allow pose to be selected. If the damage was not to the rear of the vehicle, then control is passed to decision diamond 330 to determine if the damage was to the front of the vehicle. If the damage was to the front of the vehicle, control is passed to block 335 to display the front of an appropriate vehicle to allow pose to be selected. If the damage was not to the front of the vehicle, control is passed to block 350 to display the side of an appropriate vehicle to allow pose to be selected.

Still referring to FIG. 10, control is passed to block 340 for user manual selection of the pose for each photo. Once pose has been selected, control is passed to block 345 to display control points for each of the photos. This may be performed based on the location of damage to the vehicle as well as the type of vehicle. For example, control points may be placed on the four corners of the back windshield for a vehicle involved in a rear collision. Once control points are displayed, control is passed to block 355 where the control points are selected by the user and manually placed on each vehicle picture. Control is passed to block 360 to display the crush points for the vehicle, which may also be based on damage location and vehicle type. Control is then passed to block 365 where the crush points are manually placed on each of the photos. Control is passed to block 370 to evaluate the photos for pose, control points and damage points. Control is then passed to block 375, discussed above.

Once the camera has been calibrated to determine its intrinsic parameters and the camera pose problem has been solved to determine the extrinsic parameters, the next step in the reconstruction process is to estimate the object's 3D structure. Photogrammetry uses a principle called triangulation to determine an object's three-dimensional coordinates from multiple photographs. Points are triangulated by finding the intersection of converging lines of sight, or rays as shown in FIG. 11.

Since the camera positions and orientations are only known approximately and there are errors in the measured image points, rays will often not back-project to a common intersection shown in FIG. 12. The techniques in this section describe how to estimate the 3D structure of the object. This estimate of structure, as well as the camera poses, are later refined in a process called bundle adjustment.

In each image, the 3D point, X, is projected to a measured image point, i.e., x=PX and x′=P′X. Here, P and P′ are 33 4 projection matrices given by P=K[R−RC] where K is the 3×3 camera calibration matrix, R is the 3×3 rotation matrix from object space to camera space, and C is the position of the camera with respect to the object.

The definition of vector cross product can be used to form a linear system. By definition, a cross product of two identical vectors is a vector of all zeros. Therefore, for each point, the cross product of the measured image point and the 3D point projected to that image is zero, i.e., x×(PX)=0 This results in the following linear system in X for each image. x(p ^(3T) X)−p ^(1T)0 y(p ^(3T) X)−p ^(2T)=0 x(p ^(2T) X)−y(p ^(1T) X)=0 where p^(iT) is the ith row of the projection matrix.

Using both images, a system of the form AX=0 can be composed, where

$A = \begin{bmatrix} {{x\left( {p^{3T}X} \right)} - p^{1T}} \\ {{y\left( {p^{3T}X} \right)} - p^{2T}} \\ {{x^{\prime}\left( {p^{\prime\; 3T}X} \right)} - p^{\prime\; 1T}} \\ {{y^{\prime}\left( {p^{\prime\; 3T}X} \right)} - p^{\prime\; 2T}} \end{bmatrix}$ Since only two of the three equations from each image are linearly independent, only two equations from each image are included. The solution of this system is calculated via singular value decomposition (SVD). The 3D point is then given by the smallest singular value of A. Specifically, if UDV^(T) is the singular value decomposition of A, then the solution X is the last column of V.

An alternative to the DLT method is to calculate the optimal depth. In this method, a 3D point is first calculated by back-projecting a ray through one of the measured image points for some distance, d, i.e.,

$\hat{X} = {C + {{dR}^{T}\begin{pmatrix} \frac{\left( {x - u_{0}} \right)}{f_{x}} \\ \frac{\left( {y - v_{0}} \right)}{f_{y}} \\ 1 \end{pmatrix}}}$ where the measured point is (x,y), the principal point is (u₀, v₀) and the focal lengths in the x and y direction are (f_(x), f_(y)).

This 3D point is then projected into the other image, i.e., {circumflex over (x)}′=P′{circumflex over (X)} The optimal depth is then the depth which minimizes the reprojection error in the other image.

The final step of the visual reconstruction is known as bundle adjustment. In this step, the visual reconstruction is refined to produce a jointly optimal structure (3D feature points of an object) and motion (camera pose). The name refers to the “bundles” of light which are reflected by the object's features into the camera lens, and are projected by the camera onto the 2D image surface. The bundles are optimally “adjusted” by varying both the 3D feature coordinates and the camera pose parameters, such that the total reprojection error between observed and predicted image points is minimized as shown in FIG. 13. Bundle adjustment is robust in that it is tolerant to missing data and it provides a true maximum likelihood estimate. Given initial estimates of the camera poses and 3D structure of the object, bundle adjustment is carried out using a sparse implementation of the Levenberg-Marquardt algorithm. Numerous problems in photogrammetry and computer vision are posed as non-linear least squares problems. Non-linear least squares problems have an objective function of the form

${F(p)} = {{\frac{1}{2}{\sum\limits_{i}^{M}\left\lbrack {f(p)} \right\rbrack^{2}}} = {{\frac{1}{2}{{f(p)}}^{2}} = {\frac{1}{2}{f(p)}^{T}{f(p)}}}}$

The vector f(p) is called the residual and the vector p=[p_(I), . . . P_(N)] is the set of control variables. Each element of the residual vector is the difference between an observation, x_(i), and a prediction, {circumflex over (x)}_(i). For example, in the case of bundle adjustment, the control variables are the 3D feature coordinates and camera pose parameters and the residual is the difference between observed image points and predicted image points. Non-linear least squares problems can be solved if they are overdetermined, i.e. if the number of observations, M, is greater than the number of control variables, N.

Like many other optimization problems, the necessary conditions for optimality are based on the first and second-order partial derivatives of the objective function with respect to the control variables. At a local minimizer, the gradient of the objective function should tend toward zero and the Hessian should be positive semidefinite.

The gradient of the objective function is a vector whose elements are the first-order partial derivatives with respect to the control variables, i.e.,

$g = {{F^{\prime}(p)} = \begin{bmatrix} \frac{\partial F}{\partial p_{1}} \\ \vdots \\ \frac{\partial F}{\partial p_{N}} \end{bmatrix}}$ Similarly, the Hessian of the objective function is a matrix whose elements are the second-order partial derivatives with respect to the control variables, i.e.,

$H = {{F^{n}(p)} = \begin{bmatrix} \frac{\partial^{2}F}{\partial p_{1}^{2}} & \frac{\partial^{2}F}{{\partial p_{1}}{\partial p_{2}}} & \cdots & \frac{\partial^{2}F}{{\partial p_{1}}{\partial p_{N}}} \\ \frac{\partial^{2}F}{{\partial p_{2}}{\partial p_{1}}} & \frac{\partial^{2}F}{\partial p_{2}^{2}} & \cdots & \frac{\partial^{2}F}{{\partial p_{2}}{\partial p_{N}}} \\ \vdots & \vdots & \ddots & \vdots \\ \frac{\partial^{2}F}{{\partial p_{N}}{\partial p_{1}}} & \frac{\partial^{2}F}{{\partial p_{N}}{\partial p_{2}}} & \cdots & \frac{\partial^{2}F}{\partial p_{N}^{2}} \end{bmatrix}}$ The Jacobian matrix is a matrix of all first-order partial derivatives of a vector-valued function. The Jacobian of the residual is defined as

$J_{i,j} = \frac{\partial{f_{i}(p)}}{\partial p_{j}}$ Based on the form of the objective function, the gradient can be expressed in terms of the Jacobian of the residual, i.e.,

$g = {{\frac{\partial}{\partial p_{j}}\left\{ {\frac{1}{2}{\sum\limits_{i}^{M}\left\lbrack {f(p)} \right\rbrack^{2}}} \right\}} = {{\sum\limits_{i}^{M}\left\lbrack {{f(p)}\frac{\partial{f_{i}(p)}}{\partial p_{j}}} \right\rbrack} = {J^{T}{f(p)}}}}$ Similarly, the Hessian can be expressed in terms of the Jacobian of the residual, i.e.,

$H = {F_{i,j}^{''} = {{\frac{\partial}{{\partial p_{j}}{\partial p_{k}}}{\sum\limits_{i}^{M}\left\lbrack {{f(p)}\frac{\partial{f_{i}(p)}}{\partial p_{j}}} \right\rbrack}} = {\sum\limits_{i}^{M}\left\lbrack {{{f(p)}\frac{\partial^{2}{f_{i}(p)}}{{\partial p_{j}}{\partial p_{k}}}} + {\frac{\partial{f_{i}(p)}}{\partial p_{j}}\frac{\partial{f_{i}(p)}}{\partial p_{k}}}} \right\rbrack}}}$ ${H = {{J^{T}J} + {\sum\limits_{i}^{M}\left\lbrack {{f(p)}{f^{''}(p)}} \right\rbrack}}}\mspace{346mu}$ When the residual is small, the higher order terms are negligible. Neglecting the higher order terms results in the Gauss-Newton approximation of the Hessian, i.e., H=J^(T)J

After completing the bundle adjustment, the reconstructed 3D points of the subject vehicle are compared with the geometry of an undamaged vehicle. For vehicles with front and rear damage, the difference between these points in the fore and aft direction yields the residual crush shown in FIG. 14. Likewise, for vehicles with side damage, the difference in the lateral direction yields the residual crush. Referring now to FIG. 15, shown is a flow diagram of the determination of a vehicle damage profile from a comparison of the damage measurements with an undamaged vehicle's measurements in accordance with one embodiment of the present invention. After the pose is estimated as described in method 300, control is passed to method 400 to refine pose and determine the crush points via triangulation. Method 400 may be performed in a computer system, e.g., a central system in which image data regarding a vehicle is received and processed. The initial pose estimate is refined using a bundle adjustment, as described above (block 405). Control is passed to block 410 to determine the location of the damage points using triangulation of the points on the different photos. Control is passed to block 415 to compare the damage point locations to the same points on a non-damaged vehicle. This non-damaged vehicle may be a reference or benchmark vehicle that is the same as the vehicle involved in the accident, or may be a substantially similar vehicle, such as having a common body type. Control is passed to block 420 to create a crush damage profile based on the comparison of the damaged versus the non-damaged vehicle. Control is passed to block 425 to display a crush profile on a representative vehicle, e.g., on a display of the computer system. Furthermore, the crush profile may be stored in the system for later use. For example, using the crush profile information various measures of accident severity, including potential for injury, likely damage to the vehicle and so forth may be determined. Furthermore, by obtaining the crush profile information using photographs of the vehicle via a photogrammetric analysis, embodiments may be efficiently used and may streamline an evaluation process, as photographs may be more easily obtained than further detailed information regarding damage to a vehicle. This information also may be used, e.g., to audit claim estimates to determine whether the claim repair estimate is appropriate in light of the crush profile and other information obtained via a photogramrnmetric analysis.

Once the crush damage profile has been determined, several data points of interest can be developed. One piece of information is to estimate the impact severity of the collision by calculating the change in velocity of the vehicle from the energy required to deform the vehicle. In one embodiment, the impact severity may be estimated in accordance with the methods described in U.S. Pat. No. 6,885,981 (herein the ‘981 patent’), which is commonly assigned with the present application, the contents of which are hereby incorporated by reference. Another piece of information is the Principal Direction of Force or PDOF of the collision.

FIG. 16 shows a flow diagram of the determination of PDOF from a crush profile generated by photogrammetry analysis in accordance with one embodiment of the current invention. As shown in FIG. 16, method 500, which may be performed in the same system used to receive photographic information and process the same to obtain crush data, begins at block 505, where individual components of the vehicle (e.g., hood, fenders, etc.) are evaluated for the direction in which they were moved in the accident by comparing the damaged location of the component versus the non-damaged location of the component, as present on a reference vehicle. As an example, the driver's side headlight and hood may have been shifted toward the passenger's side as a result of the impact. This result would be stored as a shift to the left. Control is passed to decision diamond 510 to determine if there were any components shifted. If components were shifted, control is passed to block 515 and the direction of each component's shift is stored. Control is passed to decision diamond 520 to evaluate if all components that are damaged are shifted in the same direction. If all components are shifted in the same direction as evaluated by decision diamond 520, control is subsequently passed to block 527 to store the resultant direction of shift. Control is subsequently passed to block 535 to evaluate input about the accident and vehicle motion before the accident, such as to determine which vehicle had a greater velocity at impact. Such input may correspond to user-supplied input regarding information associated with the accident, such as obtained by police reports, accident reports, event data recorder (EDR) data or other such means.

If decision diamond 520 results in components shifted in different directions, control is passed to evaluate the component shift (decision diamond 522). If the components are equally dispersed (e.g., a component point on the right side is shifted 2 inches to the right and the corresponding component point is shifted 2.5 inches to the left), then the PDOF would be estimated without additional input and control is passed to block 527 to store the resultant direction of shift and then to block 535 as discussed above. If the components are not equally dispersed (e.g., a component point on the right side is shifted 2 inches to the right and the corresponding component point is shifted 5 inches to the left), control may be passed to block 525 to gather more information about the component shift via requesting input from a user. Such information may include, for example, the nature of the impact (e.g., the impact was to a pole) or the nature of the component shift (e.g., are the components on the front of the vehicle shifted more to (a) the driver's side, (b) the passenger's side, or (c) neither side). Control is passed to block 540 to finalize estimate of the resultant component shift by collecting the results of the component analysis and determining the overall component shift pattern (e.g., driver's side shift, passenger side shift, or neutral shift) and control is subsequently passed to block 527 to store the resultant direction of shift. Control is subsequently passed to block 535, discussed above. Control then passes to block 530.

If no components were shifted, control is also passed to block 530. At block 530, the overlap of the damage patterns on the two vehicles may be optimized by comparing and aligning the damage profiles on each of the vehicles. Control is passed to block 545 to develop a preliminary estimate of PDOF for the first vehicle. In one embodiment, such a determination may be performed by examining the damage pattern, the overall component shift pattern and the input about vehicle motion before impact and determining the direction of the impact that is consistent with these data (e.g., a vehicle with significant component shift toward the passenger's side and crush to the head light and fender on the driver's side, and was moving slower than the other vehicle might be consistent with a 10 to 11 o'clock PDOF).

Control then passes to block 550 to develop a PDOF for the second vehicle which may be performed as previously described for the first vehicle. Control is passed to decision diamond 555 to evaluate consistency of the PDOF between the two vehicles. This consistency may be measured, e.g., based on damage information regarding the vehicles, accident characteristics and so forth. If PDOF is consistent between the vehicles, control is passed to block 565 to assign a final PDOF estimate for each vehicle, as well as generate a change in velocity (DV) for each vehicle by using the initial estimates for crush damage and PDOF to estimate the impact severity, e.g., in accordance with the methods described in the '981 patent. Otherwise, if the PDOF is not consistent as evaluated in decision diamond 555, the PDOF estimates are revised (as shown in block 560) by adjusting the PDOF estimate and/or the vehicle damage overlap optimization within reasonable parameters in an iterative process. Control is passed back to decision diamond 555 for reevaluation. When consistent PDOFs are determined, control passes to block 565, where PDOFs and change in velocity values may be finalized for the vehicles by using the initial estimates for crush damage and PDOF to estimate the impact severity, such as in accordance with the methods described in the '981 patent. These finalized estimates of both PDOF and change in velocity may be stored in the system for later use. Furthermore, this information may be reported to a user, e.g., via display or transmission to a remote location. Based on this estimated information, a user may use the information in considering whether claim information, such as property damage, personal injury and so forth is consistent with the PDOF and change in velocity. This estimated information can possibly be used in determination of liability for the accident as well by determining the point and angle of impact, pre-impact vehicle speeds, etc.

Referring now to FIG. 17, shown is a flow diagram of a method of estimating damaged components from a crush profile in accordance with one embodiment of the current invention. Method 600 may begin at block 605, where a comparison occurs between collisions of similar severity (i.e., change in velocity or DV) and direction (i.e., principal direction of force or PDOF) for the same or similar vehicles in a database and the vehicle under analysis. In some implementations, the database may be arranged in groups of files that group such similar vehicles, severity and direction. As one example, the database may be of a central system, such as an accident analysis firm that receives data regarding accidents from multiple client sources, such as insurance companies and others, and compiles the database continually as additional accident data is received, such that that database may be of a dynamic nature which may aid in obtaining accurate estimate information. Accordingly, method 600 may be used in some embodiments to act as an audit source for a claim associated with a vehicle accident to determine whether a claim level asserted is appropriate, e.g., if it falls within a range for similar accidents involving similar vehicles. Control is passed to block 610 to select and return all similar impacts, the components and operations (i.e., repair or replace) for the components for the similar collisions involving the same or a similar vehicle(s). Control is passed to block 615 to create a list of expected components and component operations for the current accident. This list may be created by identifying repair/replace components for the similarly damaged vehicles of the database. In one embodiment, the list may be created by combining or analyzing the repair/replace components of these other vehicles in accordance with a statistical analysis, a mathematical analysis or another such means. While not shown in FIG. 17, this list may be used to determine a list of repair/replace components for the vehicle, and in some embodiments, an expected cost for such repair/replacement.

Control is passed to decision diamond 620 to determine whether an independent estimate has been created. If an independent assessment of the components that need to be repaired or replaced has been developed, e.g., via a claims adjuster, control is passed to block 625 so this new assessment can be compared with the components that are predicted to need repair or replacement. Control is passed to decision diamond 630 to identify components on the estimate that are not on the expected list. If there are components that are not on the expected list, control is passed to block 635 to flag an exception with respect to the comparison. For example, any outliers may be indicated for further review and an adjuster or other remote entity may be notified, e.g., electronically (e.g., email or to the insurance company computer system) or similar communication. Control is then passed to decision diamond 645. If no components on the independent estimate are different than the expected list as determined at diamond 630, control is also passed to decision diamond 645 to determine if there are components on the expected list but not on the independent estimate. If there are components on the expected list that are not on the independent estimate, control is passed to block 640 with any outliers indicated for further review. If no components on the expected list are missing on the independent estimate, then control is passed to block 650 and the estimate is determined to have passed the audit. The audit may also be considered to pass so long as the report and the estimate are within a threshold amount (e.g., by number of components, matching score or other measures) of each other. Note also that the audit results can be stored along with an entry in the database to include information regarding the vehicle and accident, to further aid in analyzing of future accidents. While shown with this particular implementation in the embodiment of FIG. 17, it is to be understood that the scope of the present invention is not limited in this regard. For example, in other implementations in addition to or instead of analyzing crush pattern information, accident information from such similarly situated accidents may be compared to injury information, e.g., from a claim for the present accident to determine whether the contended injury level is in substantial relation with injuries occurring in these similarly situated accidents.

Referring now to FIG. 18, shown is a block diagram of a system in accordance with one embodiment of the present invention. As shown in FIG. 18, system 700 may be a computer system, such as a personal computer, server computer or other such system. System 700 may include a processor 710, which may be a microprocessor such as a central processing unit. Processor 710 is coupled via a memory controller hub (MCH) 720 that in turn is coupled to a memory 730 and a display 740, which may be a flat panel display, for example. During operation, memory 730 may store software in accordance with an embodiment of the present invention that includes instructions to perform the various techniques described herein.

As further shown in FIG. 18, MCH 720 is coupled to an input/output controller hub (ICH) 750. In turn, ICH 750 may be coupled to various peripherals 760 and a network adapter 770. Network adapter 770 may be used to communicate between system 700 and one or more other computers via a computer network, such as a local area network (LAN), a wide area network (WAN), or a wireless network, such as a wireless LAN (WLAN). Furthermore, network adapter 770 may communicate with remote systems, such as computers of an insurance company or other third party that desires to send vehicle and accident information (e.g., including photographic information) to system 700 for analysis in accordance with an embodiment of the present invention. Such communication may be via the Internet or another such computer network. In some implementations, these communications may be made secure, e.g., via encryption or in another secure format.

While the present invention has been described with respect to a limited number of embodiments, those skilled in the art will appreciate numerous modifications and variations therefrom. It is intended that the appended claims cover all such modifications and variations as fall within the true spirit and scope of this present invention. 

1. A computer-implemented method comprising: receiving a plurality of images of a vehicle involved in an accident and undergoing a computer-implemented impact severity analysis in a computer; evaluating, in the computer, the plurality of images for pose estimate information and storing a pose estimate for each of the plurality of images; refining the stored pose estimate for each of the plurality of images via a bundle adjustment; determining, in the computer, the location of damage points for the vehicle via triangulation using the plurality of images; generating, in the computer, a crush profile for the vehicle based on a difference between the location of damage points for the vehicle and corresponding points for a benchmark vehicle, the benchmark vehicle non-damaged; using the crush profile and a crush profile for a second vehicle involved in the accident, in the computer, to optimize an overlap of damage patterns on the first and second vehicles by comparison and alignment of the damage patterns on the first and second vehicles; determining, in the computer, estimates of a principal direction of force (PDOF) for the vehicle and the second vehicle based on the optimized overlap; and determining a final PDOF for the vehicle and the second vehicle based on an iterative adjustment of the PDOF estimates if the PDOF estimates are inconsistent, and storing the final PDOFs in the computer for later use.
 2. The method of claim 1, further comprising displaying the crush profile in a graphical user interface including a reference vehicle.
 3. The method of claim 1, further comprising receiving an input to select a reference pose to correspond to at least one of the plurality of images.
 4. The method of claim 3, further comprising selecting control points and crush points for at least one of the plurality of images.
 5. The method of claim 1, further comprising accessing the crush profile and selecting a plurality of stored crush profiles having corresponding crush profiles within a predetermined threshold of the crush profile.
 6. The method of claim 5, further comprising auditing a claim associated with the accident based on a comparison between the crush profile and the plurality of stored crush profiles.
 7. The method of claim 1, further comprising determining, in the computer, if a camera profile received with the plurality of images corresponds to a camera profile present in a profile database accessed by the computer.
 8. The method of claim 7, further comprising calibrating the plurality of images based on the camera profile if present, otherwise requesting additional information from a user of the camera.
 9. An article comprising a machine-accessible storage medium including instructions that when executed cause a system to: receive a plurality of images of a vehicle involved in an accident and undergoing a computer-implemented impact severity analysis in the system; evaluate the plurality of images for pose estimate information and store a pose estimate for each of the plurality of images; refine the stored pose estimate for each of the plurality of images via a bundle adjustment; determine the location of damage points for the vehicle via triangulation using the plurality of images; generate a crush profile for the vehicle based on a difference between the location of damage points for the vehicle and corresponding points for a reference vehicle, the reference vehicle non-damaged and substantially similar to the vehicle; use the crush profile and a crush profile for a second vehicle involved in the accident to optimize an overlap of damage patterns on the first and second vehicles by comparison and alignment of the damage patterns on the first and second vehicles; determine estimates of a principal direction of force (PDOF) for the vehicle and the second vehicle based on the optimized overlap; and determine a final PDOF for the vehicle and the second vehicle based on an iterative adjustment of the PDOF estimates if the PDOF estimates are inconsistent, and store the final PDOFs in the system for later use.
 10. The article of claim 9, wherein the instructions further enable the system to display the crush profile in a graphical user interface including the reference vehicle. 